Economic Regime Classification

Yijia Yu

30/07/2020

Economic Indices

Totally we have totally 13 indices reflecting US economic market performance in this study. Those indices are listed in the table below, in which you will see the name of each index and what it represent for.

Index Description
EHGDUS Index US Real GDP (QoQ, %, SAAR)
CPI YOY Index US CPI (inflation) Urban Consumer YoY NSA
CPI CHNG Index US CPI (inflation) Urban Consumer MoM SA
EHUPUS Index US Unemployment Rate (%)
IP CHNG Index US Industrial Production MoM SA
NHSPATOT Index Private Housing Units Permits Total SAAR (thousands)
NFP TCH Index US Employment on Nonfarm Payrolls Total (SA, Net Monthly Change, thousands)
TMNOCHNG Index US Manufacturing New Orders Total MoM SA
LEI TOTL Index Conference Board US Leading Economic Indicator
PITL YOY Index US Personal Income YoY SA
CICRTOT Index Federal Resrve Consumer Credit Total Net Change SA
USCABAL Index US Nominal Account Balance (Billions USD)
M2% YOY Index Federal Reserve Money Supply M2 YoY % Change

Preparing Data

Since the original data are mixed-frequency time series. To solve this problem, the frequency of all indices used in this study was chosen as quarterly. Besides, to solve the inconsistency of time length, i.e. different starting time of all indices, I used the subset series of the longest common length which starts in 1960 Q1 and ends in 2020 Q1.

Preparing Data

All index series are standardized before we do any statistical analysis, that is, centering each series (makes its mean equal to 0) and multiplying a constant to each series to make its variances equal to 1. For example, the value scale of Private Housing Units Permits Total SAAR (thousands) is much greater than else. By standardized the data series, we may ignore how magnificent of its value but the relationship between it and other series. Taking advantages of those internal relationship, we can group economic time periods into different regimes and learn how to make strategies for different economic regimes.

Data Correlation

Through looking into the correlation between each pairs of standardized indices, we can detect that some of indices are closely correlated with each other. Since we already have 13 indices Therefore, we can use some dimension reduction techniques in statistics like PCA.

## The correlation between US Real GDP (QoQ, %, SAAR) and 
##  US Manufacturing New Orders Total MoM SA is 0.44 .

## The correlation between US Real GDP (QoQ, %, SAAR) and 
##  US Nominal Account Balance (Billions USD) is 0.516 .

## The correlation between US CPI (inflation) Urban Consumer YoY NSA and 
##  US Unemployment Rate (%) is 0.719 .

## The correlation between US CPI (inflation) Urban Consumer YoY NSA and 
##  US Industrial Production MoM SA is 0.709 .

## The correlation between US CPI (inflation) Urban Consumer MoM SA and 
##  US Employment on Nonfarm Payrolls Total (SA, Net Monthly Change, thousands) is 0.662 .

## The correlation between US Unemployment Rate (%) and 
##  US Industrial Production MoM SA is 0.516 .

## The correlation between US Industrial Production MoM SA and 
##  Conference Board US Leading Economic Indicator is 0.495 .

## The correlation between US Manufacturing New Orders Total MoM SA and 
##  US Nominal Account Balance (Billions USD) is 0.681 .

## The correlation between US Manufacturing New Orders Total MoM SA and 
##  Federal Reserve Money Supply M2 YoY % Change is 0.501 .

## The correlation between US Nominal Account Balance (Billions USD) and 
##  Federal Reserve Money Supply M2 YoY % Change is 0.428 .

Labelling

According to List of recessions in the US, all quarters involved in the listed US financial recession periods were labeled as “recession”.

According to List of expansions in the US, all quarters involved in the listed US financial expansion periods were labeled as “expansion”.

Principle Component Analaysis (PCA)

Monthly data is too repetitious for PCA, as we aim to looking into different historic economic regimes rather than a specific month.

Variable contributions to PC

The plot below shows the contributions of variables in accounting for the variability to the top 2 principal components, that is, the higher contribution (%) of one economic index in this graph, the greater necessity to including this index into our analysis.

Some highly correlated indices result trivial variables which do not contribute variance to first two principle components.

Graph of individuals

Individuals with a similar profile are grouped together.

Graph of variables

Positive correlated variables point to the same side of the plot. Negative correlated variables point to opposite sides of the graph.

Biplot of both individuals and variables

How about removing those redundant variables?

2020 is extremely unusual, far away from any previous annual index performance. How about removing the data of 2020?

Now, our data become more dispersive.

Alternative way: K-means

K-means clustering (MacQueen 1967) is one of the most commonly used unsupervised machine learning algorithm for partitioning a given data set into a set of k groups (i.e. k clusters), where k represents the number of groups pre-specified by the analyst. It classifies objects in multiple groups (i.e., clusters), such that objects within the same cluster are as similar as possible (i.e., high intra-class similarity), whereas objects from different clusters are as dissimilar as possible (i.e., low inter-class similarity).

Quarterly Data Analysis

Now, we repeated the analysis using quarterly data